electronic medical record
Liver Cancer Knowledge Graph Construction based on dynamic entity replacement and masking strategies RoBERTa-BiLSTM-CRF model
Zhang, YiChi, Wang, HaiLing, Gao, YongBin, Hu, XiaoJun, Fan, YingFang, Fang, ZhiJun
Background: Liver cancer ranks as the fifth most common malignant tumor and the second most fatal in our country. Early diagnosis is crucial, necessitating that physicians identify liver cancer in patients at the earliest possible stage. However, the diagnostic process is complex and demanding. Physicians must analyze a broad spectrum of patient data, encompassing physical condition, symptoms, medical history, and results from various examinations and tests, recorded in both structured and unstructured medical formats. This results in a significant workload for healthcare professionals. In response, integrating knowledge graph technology to develop a liver cancer knowledge graph-assisted diagnosis and treatment system aligns with national efforts toward smart healthcare. Such a system promises to mitigate the challenges faced by physicians in diagnosing and treating liver cancer. Methods: This paper addresses the major challenges in building a knowledge graph for hepatocellular carcinoma diagnosis, such as the discrepancy between public data sources and real electronic medical records, the effective integration of which remains a key issue. The knowledge graph construction process consists of six steps: conceptual layer design, data preprocessing, entity identification, entity normalization, knowledge fusion, and graph visualization. A novel Dynamic Entity Replacement and Masking Strategy (DERM) for named entity recognition is proposed. Results: A knowledge graph for liver cancer was established, including 7 entity types such as disease, symptom, and constitution, containing 1495 entities. The recognition accuracy of the model was 93.23%, the recall was 94.69%, and the F1 score was 93.96%.
- Information Technology > Artificial Intelligence > Representation & Reasoning > Semantic Networks (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Information Fusion (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
TRIALSCOPE: A Unifying Causal Framework for Scaling Real-World Evidence Generation with Biomedical Language Models
González, Javier, Wong, Cliff, Gero, Zelalem, Bagga, Jass, Ueno, Risa, Chien, Isabel, Oravkin, Eduard, Kiciman, Emre, Nori, Aditya, Weerasinghe, Roshanthi, Leidner, Rom S., Piening, Brian, Naumann, Tristan, Bifulco, Carlo, Poon, Hoifung
The rapid digitization of real-world data offers an unprecedented opportunity for optimizing healthcare delivery and accelerating biomedical discovery. In practice, however, such data is most abundantly available in unstructured forms, such as clinical notes in electronic medical records (EMRs), and it is generally plagued by confounders. In this paper, we present TRIALSCOPE, a unifying framework for distilling real-world evidence from population-level observational data. TRIALSCOPE leverages biomedical language models to structure clinical text at scale, employs advanced probabilistic modeling for denoising and imputation, and incorporates state-of-the-art causal inference techniques to combat common confounders. Using clinical trial specification as generic representation, TRIALSCOPE provides a turn-key solution to generate and reason with clinical hypotheses using observational data. In extensive experiments and analyses on a large-scale real-world dataset with over one million cancer patients from a large US healthcare network, we show that TRIALSCOPE can produce high-quality structuring of real-world data and generates comparable results to marquee cancer trials. In addition to facilitating in-silicon clinical trial design and optimization, TRIALSCOPE may be used to empower synthetic controls, pragmatic trials, post-market surveillance, as well as support fine-grained patient-like-me reasoning in precision diagnosis and treatment.
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
- North America > United States > New York > New York County > New York City (0.04)
- North America > Canada (0.04)
- (3 more...)
- Research Report > Strength Medium (1.00)
- Research Report > New Finding (1.00)
- Research Report > Experimental Study (1.00)
- Research Report > Strength High (0.93)
Application of Transformers based methods in Electronic Medical Records: A Systematic Literature Review
Batista, Vitor Alcantara, Evsukoff, Alexandre Gonçalves
The combined growth of available data and their unstructured nature has received increased interest in natural language processing (NLP) techniques to make value of these data assets since this format is not suitable for statistical analysis. This work presents a systematic literature review of state-of-the-art advances using transformer-based methods on electronic medical records (EMRs) in different NLP tasks. To the best of our knowledge, this work is unique in providing a comprehensive review of research on transformer-based methods for NLP applied to the EMR field. In the initial query, 99 articles were selected from three public databases and filtered into 65 articles for detailed analysis. The papers were analyzed with respect to the business problem, NLP task, models and techniques, availability of datasets, reproducibility of modeling, language, and exchange format. The paper presents some limitations of current research and some recommendations for further research.
- North America > United States > New York > New York County > New York City (0.05)
- South America > Brazil > Rio de Janeiro > Rio de Janeiro (0.05)
- North America > Canada > British Columbia > Vancouver Island > Capital Regional District > Victoria (0.04)
- (14 more...)
- Research Report > Experimental Study (1.00)
- Overview (1.00)
- Research Report > New Finding (0.68)
AI Expert: We Should Stop Using So Much AI
Meredith Broussard is unusually well placed to dissect the ongoing hype around AI. She's a data scientist and associate professor at New York University, and she's been one of the leading researchers in the field of algorithmic bias for years. And though her own work leaves her buried in math problems, she's spent the last few years thinking about problems that mathematics can't solve. Her reflections have made their way into a new book about the future of AI. In More than a Glitch, Broussard argues that we are consistently too eager to apply artificial intelligence to social problems in inappropriate and damaging ways. Her central claim is that using technical tools to address social problems without considering race, gender, and ability can cause immense harm.
Meet the AI expert who says we should stop using AI so much
Broussard has also recently recovered from breast cancer, and after reading the fine print of her electronic medical records, she realized that an AI had played a part in her diagnosis--something that is increasingly common. That discovery led her to run her own experiment to learn more about how good AI was at cancer diagnostics. We sat down to talk about what she discovered, as well as the problems with the use of technology by police, the limits of "AI fairness," and the solutions she sees for some of the challenges AI is posing. The conversation has been edited for clarity and length. At the beginning of the pandemic, I was diagnosed with breast cancer.
The future of healthcare and how technological breakthroughs will impact it
With the help of a variety of cutting-edge technologies, such as telemedicine, electronic medical records, home-based care transitioning from hospital-based care, drone technology, genome sequencing, digital tools, and artificial intelligence (AI), the healthcare sector has been transforming drastically. Undoubtedly, the pandemic accelerated the acceptance and advancement of technology in healthcare. Patients can now obtain medical care more quickly and easily outside of the typical hospital setting, improving convenience and accessibility for everyone. Furthermore, the exponential growth of the diagnostic sector has contributed to the growth in overall healthcare industry in India. The current situation has been significantly changed by modern and high-end diagnostics, which have replaced the conventional ways of diagnosis with new age, digital-led infrastructures backed by AI and ML.
End-to-end Clinical Event Extraction from Chinese Electronic Health Record
Feng, Wei, Huang, Ruochen, Yu, Yun, Sun, Huiting, Liu, Yun
Event extraction is an important work of medical text processing. According to the complex characteristics of medical text annotation, we use the end-to-end event extraction model to enhance the output formatting information of events. Through pre training and fine-tuning, we can extract the attributes of the four dimensions of medical text: anatomical position, subject word, description word and occurrence state. On the test set, the accuracy rate was 0.4511, the recall rate was 0.3928, and the F1 value was 0.42. The method of this model is simple, and it has won the second place in the task of mining clinical discovery events (task2) in the Chinese electronic medical record of the seventh China health information processing Conference (chip2021).
Why natural language processing (NLP) is a critical part of your real world data strategy
It's time to unleash the value of unstructured data The flow of relevant real world data (RWD) to the life sciences industry has exploded in recent years, and most (80%) of that new content comes in the form of unstructured data. Unstructured data includes all the information that is shared in textual narrative and conversational formats. And in the age of Twitter and chatbots, that covers quite a bit of ground. This information can be drawn from social media posts, journal articles, virtual customer contact requests, telehealth conversations, medical notes, and research information – and that is just the short list. This content can be rich in value, but that value is often underutilized because these documents are difficult to manually review, translate, and analyze.
Embracing AI When Your Industry Is in Flux
One of the great challenges we have seen businesses face in recent years is how they approach data and analytics (and now artificial intelligence) when their industries are undergoing major transformation. It's hard enough to create a data-driven culture, compete on analytics, develop data-driven products and services, and so forth under normal business conditions, as we noted in our March column about the newest NewVantage Partners survey on big data and AI. But doing it while your business and industry are transforming -- the old line of changing out a jet engine while the plane is flying through turbulence at 35,000 feet -- is really tough. It's so difficult, in fact, that we always have our doubts when executives claim to have done it successfully. We are much more trusting when we're told that the organization is simply making progress toward the goal.
- Information Technology > Artificial Intelligence (1.00)
- Information Technology > Data Science > Data Mining (0.36)
Synthesizing time-series wound prognosis factors from electronic medical records using generative adversarial networks
Foomani, Farnaz H., Anisuzzaman, D. M., Niezgoda, Jeffrey, Niezgoda, Jonathan, Guns, William, Gopalakrishnan, Sandeep, Yu, Zeyun
Wound prognostic models not only provide an estimate of wound healing time to motivate patients to follow up their treatments but also can help clinicians to decide whether to use a standard care or adjuvant therapies and to assist them with designing clinical trials. However, collecting prognosis factors from Electronic Medical Records (EMR) of patients is challenging due to privacy, sensitivity, and confidentiality. In this study, we developed time series medical generative adversarial networks (GANs) to generate synthetic wound prognosis factors using very limited information collected during routine care in a specialized wound care facility. The generated prognosis variables are used in developing a predictive model for chronic wound healing trajectory. Our novel medical GAN can produce both continuous and categorical features from EMR. Moreover, we applied temporal information to our model by considering data collected from the weekly follow-ups of patients. Conditional training strategies were utilized to enhance training and generate classified data in terms of healing or non-healing. The ability of the proposed model to generate realistic EMR data was evaluated by TSTR (test on the synthetic, train on the real), discriminative accuracy, and visualization. We utilized samples generated by our proposed GAN in training a prognosis model to demonstrate its real-life application. Using the generated samples in training predictive models improved the classification accuracy by 6.66-10.01% compared to the previous EMR-GAN. Additionally, the suggested prognosis classifier has achieved the area under the curve (AUC) of 0.975, 0.968, and 0.849 when training the network using data from the first three visits, first two visits, and first visit, respectively. These results indicate a significant improvement in wound healing prediction compared to the previous prognosis models.
- Research Report > New Finding (1.00)
- Research Report > Experimental Study (1.00)